Empirical Similarity ∗

نویسنده

  • David Schmeidler
چکیده

An agent is asked to assess a real-valued variable Yp based on certain characteristics Xp = (X1 p , ...,X m p ), and on a database consisting (X1 i , ...,X m i , Yi) for i = 1, ..., n. A possible approach to combine past observations of X and Y with the current values of X to generate an assessment of Y is similarity-weighted averaging. It suggests that the predicted value of Y , Ȳ s p , be the weighted average of all previously observed values Yi, where the weight of Yi, for every i = 1, ..., n, is the similarity between the vector X1 p , ...,X m p , associated with Yp, and the previously observed vector, X1 i , ...,X m i . We axiomatize this rule. We assume that, given every database, a predictor has a ranking over possible values, and we show that certain reasonable conditions on these rankings imply that they are determined by the proximity to a similarity-weighted average for a certain similarity function. The axiomatization does not suggest a particular similarity function, or even a particular functional form of this function. We therefore proceed to suggest that the similarity function be estimated from past observations. We develop tools of statistical inference for parametric estimation of the similarity function, for the case of a continuous as well as a discrete variable. Finally, we discuss the relationship of the proposed method to other methods of estimation and prediction. JEL Codes: C1, C8, D8

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...

متن کامل

Evaluating Different Approaches to Measuring the Similarity of Melodies

This paper describes an empirical approach to evaluating similarity measures for the comparision of two note sequences or melodies. In the first sections the experimental approach and the empirical results of previous studies on melodic similarity are reported. In the discussion section several questions are raised that concern the nature of similarity or distance measures for melodies and musi...

متن کامل

On the definition of objective probabilities by empirical similarity

We suggest to define objective probabilities by similarity-weighted empirical frequencies, where more similar cases get a higher weight in the computation of frequencies. This formula is justified intuitively and axiomatically, but raises the question, which similarity function should be used? We propose to estimate the similarity function from the data, and thus obtain objective probabilities....

متن کامل

Wavelet-based confidence intervals for the self-similarity parameter

We propose and compare several methods of constructing wavelet-based confidence intervals for the self-similarity parameter in heavy-tailed observations. We use empirical coverage probabilities to assess the procedures by applying them to Linear Fractional Stable Motion with many choices of parameters. We find that the asymptotic confidence intervals provide empirical coverage often much lower ...

متن کامل

Vendor Selection: An Enhanced Hybrid Fuzzy MCDM Model

The objective of this article is to develop an empirically based framework for formulating and selecting a vendor in supply chain. This study applies the fuzzy set theory to evaluate the vendor selection decision. Applying Analytic Hierarchy Process (AHP) in obtaining criteria weights and applied Technique for Order Performance by Similarity to Idea Solution (TOPSIS) for obtaining final ranking...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004